Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 124
Filter
1.
Sci Data ; 11(1): 465, 2024 May 08.
Article in English | MEDLINE | ID: mdl-38719810

ABSTRACT

Myriad policy, ethical and legal considerations underpin the sharing of biological resources, implying the need for standardised and yet flexible ways to digitally represent diverse 'use conditions'. We report a core lexicon of terms that are atomic, non-directional 'concepts of use', called Common Conditions of use Elements. This work engaged biobanks and registries relevant to the European Joint Programme for Rare Diseases and aimed to produce a lexicon that would have generalised utility. Seventy-six concepts were initially identified from diverse real-world settings, and via iterative rounds of deliberation and user-testing these were optimised and condensed down to 20 items. To validate utility, support software and training information was provided to biobanks and registries who were asked to create Sharing Policy Profiles. This succeeded and involved adding standardised directionality and scope annotations to the employed terms. The addition of free-text parameters was also explored. The approach is now being adopted by several real-world projects, enabling this standard to evolve progressively into a universal basis for representing and managing conditions of use.


Subject(s)
Biological Specimen Banks , Humans , Information Dissemination , Registries
2.
Sci Data ; 11(1): 464, 2024 May 08.
Article in English | MEDLINE | ID: mdl-38719839

ABSTRACT

Improving patient care and advancing scientific discovery requires responsible sharing of research data, healthcare records, biosamples, and biomedical resources that must also respect applicable use conditions. Defining a standard to structure and manage these use conditions is a complex and challenging task. This is exemplified by a near unlimited range of asset types, a high variability of applicable conditions, and differing applications at the individual or collective level. Furthermore, the specifics and granularity required are likely to vary depending on the ultimate contexts of use. All these factors confound alignment of institutional missions, funding objectives, regulatory and technical requirements to facilitate effective sharing. The presented work highlights the complexity and diversity of the problem, reviews the current state of the art, and emphasises the need for a flexible and adaptable approach. We propose Digital Use Conditions (DUC) as a framework that addresses these needs by leveraging existing standards, striking a balance between expressiveness versus ambiguity, and considering the breadth of applicable information with their context of use.


Subject(s)
Information Dissemination , Humans
4.
Front Neurol ; 14: 1187095, 2023.
Article in English | MEDLINE | ID: mdl-37545729

ABSTRACT

Efficient data sharing is hampered by an array of organizational, ethical, behavioral, and technical challenges, slowing research progress and reducing the utility of data generated by clinical research studies on neurodegenerative diseases. There is a particular need to address differences between public and private sector environments for research and data sharing, which have varying standards, expectations, motivations, and interests. The Neuronet data sharing Working Group was set up to understand the existing barriers to data sharing in public-private partnership projects, and to provide guidance to overcome these barriers, by convening data sharing experts from diverse projects in the IMI neurodegeneration portfolio. In this policy and practice review, we outline the challenges and learnings of the WG, providing the neurodegeneration community with examples of good practices and recommendations on how to overcome obstacles to data sharing. These obstacles span organizational issues linked to the unique structure of cross-sectoral, collaborative research initiatives, to technical issues that affect the storage, structure and annotations of individual datasets. We also identify sociotechnical hurdles, such as academic recognition and reward systems that disincentivise data sharing, and legal challenges linked to heightened perceptions of data privacy risk, compounded by a lack of clear guidance on GDPR compliance mechanisms for public-private research. Focusing on real-world, neuroimaging and digital biomarker data, we highlight particular challenges and learnings for data sharing, such as data management planning, development of ethical codes of conduct, and harmonization of protocols and curation processes. Cross-cutting solutions and enablers include the principles of transparency, standardization and co-design - from open, accessible metadata catalogs that enhance findability of data, to measures that increase visibility and trust in data reuse.

5.
Lancet Public Health ; 8(7): e535-e545, 2023 Jul.
Article in English | MEDLINE | ID: mdl-37393092

ABSTRACT

BACKGROUND: To inform targeted public health strategies, it is crucial to understand how coexisting diseases develop over time and their associated impacts on patient outcomes and health-care resources. This study aimed to examine how psychosis, diabetes, and congestive heart failure, in a cluster of physical-mental health multimorbidity, develop and coexist over time, and to assess the associated effects of different temporal sequences of these diseases on life expectancy in Wales. METHODS: In this retrospective cohort study, we used population-scale, individual-level, anonymised, linked, demographic, administrative, and electronic health record data from the Wales Multimorbidity e-Cohort. We included data on all individuals aged 25 years and older who were living in Wales on Jan 1, 2000 (the start of follow-up), with follow-up continuing until Dec 31, 2019, first break in Welsh residency, or death. Multistate models were applied to these data to model trajectories of disease in multimorbidity and their associated effect on all-cause mortality, accounting for competing risks. Life expectancy was calculated as the restricted mean survival time (bound by the maximum follow-up of 20 years) for each of the transitions from the health states to death. Cox regression models were used to estimate baseline hazards for transitions between health states, adjusted for sex, age, and area-level deprivation (Welsh Index of Multiple Deprivation [WIMD] quintile). FINDINGS: Our analyses included data for 1 675 585 individuals (811 393 [48·4%] men and 864 192 [51·6%] women) with a median age of 51·0 years (IQR 37·0-65·0) at cohort entry. The order of disease acquisition in cases of multimorbidity had an important and complex association with patient life expectancy. Individuals who developed diabetes, psychosis, and congestive heart failure, in that order (DPC), had reduced life expectancy compared with people who developed the same three conditions in a different order: for a 50-year-old man in the third quintile of the WIMD (on which we based our main analyses to allow comparability), DPC was associated with a loss in life expectancy of 13·23 years (SD 0·80) compared with the general otherwise healthy or otherwise diseased population. Congestive heart failure as a single condition was associated with mean a loss in life expectancy of 12·38 years (0·00), and with a loss of 12·95 years (0·06) when preceded by psychosis and 13·45 years (0·13) when followed by psychosis. Findings were robust in people of older ages, more deprived populations, and women, except that the trajectory of psychosis, congestive heart failure, and diabetes was associated with higher mortality in women than men. Within 5 years of an initial diagnosis of diabetes, the risk of developing psychosis or congestive heart failure, or both, was increased. INTERPRETATION: The order in which individuals develop psychosis, diabetes, and congestive heart failure as combinations of conditions can substantially affect life expectancy. Multistate models offer a flexible framework to assess temporal sequences of diseases and allow identification of periods of increased risk of developing subsequent conditions and death. FUNDING: Health Data Research UK.


Subject(s)
Diabetes Mellitus , Heart Failure , Psychotic Disorders , Male , Humans , Female , Adult , Middle Aged , Aged , Semantic Web , Multimorbidity , Retrospective Studies , Wales/epidemiology , Diabetes Mellitus/epidemiology , Heart Failure/epidemiology , Psychotic Disorders/epidemiology , Life Expectancy
6.
Nucleic Acids Res ; 51(D1): D986-D993, 2023 01 06.
Article in English | MEDLINE | ID: mdl-36350644

ABSTRACT

The GWAS Central resource gathers and curates extensive summary-level genome-wide association study (GWAS) data and puts a range of user-friendly but powerful website tools for the comparison and visualisation of GWAS data at the fingertips of researchers. Through our continued efforts to harmonise and import data received from GWAS authors and consortia, and data sets actively collected from public sources, the database now contains over 72.5 million P-values for over 5000 studies testing over 7.4 million unique genetic markers investigating over 1700 unique phenotypes. Here, we describe an update to integrate this extensive data collection with mouse disease model data to support insights into the functional impact of human genetic variation. GWAS Central has expanded to include mouse gene-phenotype associations observed during mouse gene knockout screens. To allow similar cross-species phenotypes to be compared, terms from mammalian and human phenotype ontologies have been mapped. New interactive interfaces to find, correlate and view human and mouse genotype-phenotype associations are included in the website toolkit. Additionally, the integrated browser for interrogating multiple association data sets has been updated and a GA4GH Beacon API endpoint has been added for discovering variants tested in GWAS. The GWAS Central resource is accessible at https://www.gwascentral.org/.


Subject(s)
Databases, Genetic , Genome-Wide Association Study , Humans , Animals , Mice , Genotype , Phenotype , Data Collection , Polymorphism, Single Nucleotide , Mammals
7.
Front Neurol ; 13: 1031091, 2022.
Article in English | MEDLINE | ID: mdl-36530625

ABSTRACT

Biomarker discovery, development, and validation are reliant on large-scale analyses of high-quality samples and data. Currently, significant quantities of data and samples have been generated by European studies on Alzheimer's disease (AD) and other neurodegenerative diseases (NDD), representing a valuable resource for developing biomarkers to support early detection of disease, treatment monitoring, and patient stratification. However, discovery of, access to, and sharing of data and samples from AD and NDD research are hindered both by silos that limit collaboration, and by the array of complex requirements for secure, legal, and ethical sharing. In this Perspective article, we examine key challenges currently hampering large-scale biomarker research, and outline how the European Platform for Neurodegenerative Diseases (EPND) plans to address them. The first such challenge is a fragmented landscape filled with technical barriers that make it difficult to discover and access high-quality samples and data in one location. A second challenge is related to the complex array of legal and ethical requirements that must be navigated by researchers when sharing data and samples, to ensure compliance with data protection regulations and research ethics. Another challenge is the lack of broad-scale collaboration and opportunities to facilitate partnerships between data and sample contributors and researchers, in addition to a lack of regulatory engagement early in the research process to enable validation of potential biomarkers. A further challenge facing projects is the need to remain sustainable beyond initial funding periods, ensuring data and samples are shared and reused, thereby driving further research and innovation. In addressing these challenges, EPND will enable an environment of faster and more disruptive research on diagnostics and disease-modifying therapies for Alzheimer's disease and other neurodegenerative diseases.

8.
Hum Mutat ; 43(6): 791-799, 2022 06.
Article in English | MEDLINE | ID: mdl-35297548

ABSTRACT

Beacon is a basic data discovery protocol issued by the Global Alliance for Genomics and Health (GA4GH). The main goal addressed by version 1 of the Beacon protocol was to test the feasibility of broadly sharing human genomic data, through providing simple "yes" or "no" responses to queries about the presence of a given variant in datasets hosted by Beacon providers. The popularity of this concept has fostered the design of a version 2, that better serves real-world requirements and addresses the needs of clinical genomics research and healthcare, as assessed by several contributing projects and organizations. Particularly, rare disease genetics and cancer research will benefit from new case level and genomic variant level requests and the enabling of richer phenotype and clinical queries as well as support for fuzzy searches. Beacon is designed as a "lingua franca" to bridge data collections hosted in software solutions with different and rich interfaces. Beacon version 2 works alongside popular standards like Phenopackets, OMOP, or FHIR, allowing implementing consortia to return matches in beacon responses and provide a handover to their preferred data exchange format. The protocol is being explored by other research domains and is being tested in several international projects.


Subject(s)
Genomics , Information Dissemination , Humans , Information Dissemination/methods , Phenotype , Rare Diseases , Software
9.
Cell Genom ; 1(2): None, 2021 Nov 10.
Article in English | MEDLINE | ID: mdl-34820659

ABSTRACT

Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard. DUO is a hierarchical vocabulary of human and machine-readable data use terms that consistently and unambiguously represents a dataset's allowable data uses. DUO has been implemented by major international stakeholders such as the Broad and Sanger Institutes and is currently used in annotation of over 200,000 datasets worldwide. Using DUO in data management and access facilitates researchers' discovery and access of relevant datasets. DUO annotations increase the FAIRness of datasets and support data linkages using common data use profiles when integrating the data for secondary analyses. DUO is implemented in the Web Ontology Language (OWL) and, to increase community awareness and engagement, hosted in an open, centralized GitHub repository. DUO, together with the GA4GH Passport standard, offers a new, efficient, and streamlined data authorization and access framework that has enabled increased sharing of biomedical datasets worldwide.

11.
Eur J Hum Genet ; 29(9): 1325-1331, 2021 09.
Article in English | MEDLINE | ID: mdl-34075208

ABSTRACT

For the first time in Europe hundreds of rare disease (RD) experts team up to actively share and jointly analyse existing patient's data. Solve-RD is a Horizon 2020-supported EU flagship project bringing together >300 clinicians, scientists, and patient representatives of 51 sites from 15 countries. Solve-RD is built upon a core group of four European Reference Networks (ERNs; ERN-ITHACA, ERN-RND, ERN-Euro NMD, ERN-GENTURIS) which annually see more than 270,000 RD patients with respective pathologies. The main ambition is to solve unsolved rare diseases for which a molecular cause is not yet known. This is achieved through an innovative clinical research environment that introduces novel ways to organise expertise and data. Two major approaches are being pursued (i) massive data re-analysis of >19,000 unsolved rare disease patients and (ii) novel combined -omics approaches. The minimum requirement to be eligible for the analysis activities is an inconclusive exome that can be shared with controlled access. The first preliminary data re-analysis has already diagnosed 255 cases form 8393 exomes/genome datasets. This unprecedented degree of collaboration focused on sharing of data and expertise shall identify many new disease genes and enable diagnosis of many so far undiagnosed patients from all over Europe.


Subject(s)
Genetic Diseases, Inborn/genetics , Information Dissemination , Intersectoral Collaboration , Rare Diseases/genetics , Consensus Development Conferences as Topic , Europe , Genetic Diseases, Inborn/diagnosis , Genetic Testing/methods , Humans , Rare Diseases/diagnosis , Exome Sequencing/methods
12.
Anal Biochem ; 626: 114124, 2021 08 01.
Article in English | MEDLINE | ID: mdl-33607059

ABSTRACT

We report proof-of-principle experiments regarding a dynamic microarray protocol enabling accurate and semi-quantitative DNA analysis for re-sequencing, fingerprinting and genotyping. Single-stranded target molecules hybridise to surface-bound probes during initial gradual cooling with high-fidelity. Real-time tracking of target denaturation (via fluorescence) during a 'dynamic' gradual heating phase permits 'melt-curve' analysis. The probe most closely matching the target sequence is identified based on the highest melting temperature. We demonstrated a >99% re-sequencing accuracy and a potential detection rate of 1% for SNPs. Experiments employing Hypericum ribosomal ITS regions and HIV genomes illustrated a reliable detection level of 5% plus simultaneous re-sequencing and genotyping. Such performance suggests a range of potential real-world applications involving rapid sequence interrogation, for example, in the Covid-19 pandemic. Guidance is offered towards the development of a commercial platform and dedicated software required to bring this technique into mainstream science.


Subject(s)
COVID-19/genetics , Genome, Plant , Genome, Viral , Genotyping Techniques , HIV-1/genetics , Hypericum/genetics , Oligonucleotide Array Sequence Analysis , Software , COVID-19/epidemiology , Humans
13.
BMJ Open ; 11(1): e047101, 2021 01 19.
Article in English | MEDLINE | ID: mdl-33468531

ABSTRACT

INTRODUCTION: Multimorbidity is widely recognised as the presence of two or more concurrent long-term conditions, yet remains a poorly understood global issue despite increasing in prevalence.We have created the Wales Multimorbidity e-Cohort (WMC) to provide an accessible research ready data asset to further the understanding of multimorbidity. Our objectives are to create a platform to support research which would help to understand prevalence, trajectories and determinants in multimorbidity, characterise clusters that lead to highest burden on individuals and healthcare services, and evaluate and provide new multimorbidity phenotypes and algorithms to the National Health Service and research communities to support prevention, healthcare planning and the management of individuals with multimorbidity. METHODS AND ANALYSIS: The WMC has been created and derived from multisourced demographic, administrative and electronic health record data relating to the Welsh population in the Secure Anonymised Information Linkage (SAIL) Databank. The WMC consists of 2.9 million people alive and living in Wales on the 1 January 2000 with follow-up until 31 December 2019, Welsh residency break or death. Published comorbidity indices and phenotype code lists will be used to measure and conceptualise multimorbidity.Study outcomes will include: (1) a description of multimorbidity using published data phenotype algorithms/ontologies, (2) investigation of the associations between baseline demographic factors and multimorbidity, (3) identification of temporal trajectories of clusters of conditions and multimorbidity and (4) investigation of multimorbidity clusters with poor outcomes such as mortality and high healthcare service utilisation. ETHICS AND DISSEMINATION: The SAIL Databank independent Information Governance Review Panel has approved this study (SAIL Project: 0911). Study findings will be presented to policy groups, public meetings, national and international conferences, and published in peer-reviewed journals.


Subject(s)
Multimorbidity , State Medicine , Cohort Studies , Epidemiologic Studies , Female , Humans , Information Storage and Retrieval , Male , Wales/epidemiology
14.
Cell Genom ; 1(2)2021 Nov 10.
Article in English | MEDLINE | ID: mdl-35072136

ABSTRACT

The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits.

16.
Respir Res ; 21(1): 183, 2020 Jul 14.
Article in English | MEDLINE | ID: mdl-32664956

ABSTRACT

BACKGROUND: Airway bacterial dysbiosis is a feature of chronic obstructive pulmonary disease (COPD). However, there is limited comparative data of the lung microbiome between healthy smokers, non-smokers and COPD. METHODS: We compared the 16S rRNA gene-based sputum microbiome generated from pair-ended Illumina sequencing of 124 healthy subjects (28 smokers and 96 non-smokers with normal lung function), with single stable samples from 218 COPD subjects collected from three UK clinical centres as part of the COPDMAP consortium. RESULTS: In healthy subjects Firmicutes, Bacteroidetes and Actinobacteria were the major phyla constituting 88% of the total reads, and Streptococcus, Veillonella, Prevotella, Actinomyces and Rothia were the dominant genera. Haemophilus formed only 3% of the healthy microbiome. In contrast, Proteobacteria was the most dominant phylum accounting for 50% of the microbiome in COPD subjects, with Haemophilus and Moraxella at genus level contributing 25 and 3% respectively. There were no differences in the microbiome profile within healthy and COPD subgroups when stratified based on smoking history. Principal coordinate analysis on operational taxonomic units showed two distinct clusters, representative of healthy and COPD subjects (PERMANOVA, p = 0·001). CONCLUSION: The healthy and COPD sputum microbiomes are distinct and independent of smoking history. Our results underline the important role for Gammaproteobacteria in COPD.


Subject(s)
Lung/microbiology , Non-Smokers , Pulmonary Disease, Chronic Obstructive/microbiology , Smokers , Sputum/microbiology , Aged , Case-Control Studies , Dysbiosis , England , Female , Health Status , Humans , Male , Middle Aged , Pulmonary Disease, Chronic Obstructive/diagnosis , Ribotyping
17.
Alzheimers Res Ther ; 12(1): 8, 2020 01 06.
Article in English | MEDLINE | ID: mdl-31907067

ABSTRACT

BACKGROUND: Recruitment is often a bottleneck in secondary prevention trials in Alzheimer disease (AD). Furthermore, screen-failure rates in these trials are typically high due to relatively low prevalence of AD pathology in individuals without dementia, especially among cognitively unimpaired. Prescreening on AD risk factors may facilitate recruitment, but the efficiency will depend on how these factors link to participation rates and AD pathology. We investigated whether common AD-related factors predict trial-ready cohort participation and amyloid status across different prescreen settings. METHODS: We monitored the prescreening in four cohorts linked to the European Prevention of Alzheimer Dementia (EPAD) Registry (n = 16,877; mean ± SD age = 64 ± 8 years). These included a clinical cohort, a research in-person cohort, a research online cohort, and a population-based cohort. Individuals were asked to participate in the EPAD longitudinal cohort study (EPAD-LCS), which serves as a trial-ready cohort for secondary prevention trials. Amyloid positivity was measured in cerebrospinal fluid as part of the EPAD-LCS assessment. We calculated participation rates and numbers needed to prescreen (NNPS) per participant that was amyloid-positive. We tested if age, sex, education level, APOE status, family history for dementia, memory complaints or memory scores, previously collected in these cohorts, could predict participation and amyloid status. RESULTS: A total of 2595 participants were contacted for participation in the EPAD-LCS. Participation rates varied by setting between 3 and 59%. The NNPS were 6.9 (clinical cohort), 7.5 (research in-person cohort), 8.4 (research online cohort), and 88.5 (population-based cohort). Participation in the EPAD-LCS (n = 413 (16%)) was associated with lower age (odds ratio (OR) age = 0.97 [0.95-0.99]), high education (OR = 1.64 [1.23-2.17]), male sex (OR = 1.56 [1.19-2.04]), and positive family history of dementia (OR = 1.66 [1.19-2.31]). Among participants in the EPAD-LCS, amyloid positivity (33%) was associated with higher age (OR = 1.06 [1.02-1.10]) and APOE ɛ4 allele carriership (OR = 2.99 [1.81-4.94]). These results were similar across prescreen settings. CONCLUSIONS: Numbers needed to prescreen varied greatly between settings. Understanding how common AD risk factors link to study participation and amyloid positivity is informative for recruitment strategy of studies on secondary prevention of AD.


Subject(s)
Alzheimer Disease/prevention & control , Patient Selection , Aged , Amyloidogenic Proteins/metabolism , Brain/pathology , Cohort Studies , Female , Humans , Longitudinal Studies , Male , Middle Aged , Registries , Risk Factors
18.
Nucleic Acids Res ; 48(D1): D933-D940, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31612961

ABSTRACT

The GWAS Central resource provides a toolkit for integrative access and visualization of a uniquely extensive collection of genome-wide association study data, while ensuring safe open access to prevent research participant identification. GWAS Central is the world's most comprehensive openly accessible repository of summary-level GWAS association information, providing over 70 million P-values for over 3800 studies investigating over 1400 unique phenotypes. The database content comprises direct submissions received from GWAS authors and consortia, in addition to actively gathered data sets from various public sources. GWAS data are discoverable from the perspective of genetic markers, genes, genome regions or phenotypes, via graphical visualizations and detailed downloadable data reports. Tested genetic markers and relevant genomic features can be visually interrogated across up to sixteen multiple association data sets in a single view using the integrated genome browser. The semantic standardization of phenotype descriptions with Medical Subject Headings and the Human Phenotype Ontology allows the precise identification of genetic variants associated with diseases, phenotypes and traits of interest. Harmonization of the phenotype descriptions used across several GWAS-related resources has extended the phenotype search capabilities to enable cross-database study discovery using a range of ontologies. GWAS Central is updated regularly and available at https://www.gwascentral.org.


Subject(s)
Computational Biology/methods , Databases, Genetic , Genome-Wide Association Study , Genomics , Genotype , Phenotype , Software , Gene Ontology , Genome-Wide Association Study/methods , Genomics/methods , Software Design , User-Computer Interface , Web Browser
SELECTION OF CITATIONS
SEARCH DETAIL
...